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abstract: Coronavirus nucleocapsid (N) protein envelops the genomic RNA to form long helical 
nucleocapsid during virion assembly. Since N protein oligomerization is usually a crucial step in this 
process, characterization of such an oligomerization will help in the understanding of the possible 
mechanisms for nucleocapsid formation. The N protein of severe acute respiratory syndrome coronavirus 
(SARS-CoV) was recently discovered to self-associate by its carboxyl terminus. In this study, to further 
address the detailed understanding of the association feature of this C-terminus, its oligomerization was 
systematically investigated by size exclusion chromatography and chemical cross-linking assays. Our results 
clearly indicated that the C-terminal domain of SARS-CoV N protein could form not only dimers but 
also trimers, tetramers, and hexamers. Further analyses against six deletion mutants showed that residues 
343—402 were necessary and sufficient for this C-terminus oligomerization. Although this segment contains 
many charged residues, differences in ionic strength have no effects on its oligomerization, indicating the 
absence of electrostatic force in SARS-CoV N protein C-terminus self-association. Gel shift assay results 
revealed that the SARS-CoV N protein C-terminus is also able to associate with nucleic acids and residues 
363—382 are the responsible interaction partner, demonstrating that this fragment might involve genomic 
RNA binding sites. The fact that nucleic acid binding could promote the SARS-CoV N protein C-terminus 
to form high-order oligomers implies that the oligomeric SARS-CoV N protein probably combines with 
the viral genomic RNA in triggering long nucleocapsid formation. 


Severe acute respiratory syndrome (SARS) is an emerging 
infectious disease caused by SARS coronavirus (SARS- 
CoV)' {1—4). SARS-CoV is a novel type of Coronaviridae 
(5) and is moderately related to the other known corona- 
viruses (6). It contains several structural proteins, namely, 
spike protein (S), small envelope protein (E), membrane 
protein (M), and nucleocapsid protein (N) (6, 7). SARS-CoV 
N protein is one of the most abundant stractural proteins 
and performs many biological functions. It may bind to the 
structure M protein, forming a virion {8, 9), and interact with 
the heterogeneous nuclear ribonucleoprotein A1 (huRNP Al), 
playing an important role in the process of mRNA synthesis 
(70, 11). SARS-CoV N protein inhibits the activity of the 
cyclin—CDK complex and blocks S phase progression in 
mammalian cells (72). Recently, it was even found to be 
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able to activate the expression of cyclooxygenase-2, resulting 
in inflammation (75). 

It is known that coronavirus N protein is able to encapsi- 
date RNA into ribonucleoprotein (RNP) or a long helical 
nucleocapsid structure (74, 75), and SARS-CoV N protein 
is suggested to play an essential role in viral RNA packaging 
(76). Self-association of N protein is an important step within 
virus particle assembly for many viruses (75, 17—19). The 
detailed oligomerization study has revealed that the full- 
length SARS-CoV N protein is able to form high-order 
oligomers but exists predominantly as dimers (20-24). The 
self-association domain in SARS-CoV N was first mapped 
out as the C-terminal 209 amino acids using the yeast two- 
hybrid (23). Later this domain was shortened to the C- 
terminal 138 (residues 285—422) (20) and 140 (residues 
283—422) amino acids (25) by two independent groups. Yu 
et al. reported that residues 285—422 of SARS-CoV N 
formed only dimers ( 20 ), while our previous work revealed 
that residues 283—422 formed not only dimers but also 
multimers (25). This illegibility needs more exploration to 
clarify. In addition, NMR structural analysis and the crystal 
structure revealed that parts of the C-terminus, residues 248— 
365 and residues 270—370, are able to form dimers (26, 27); 
however, the relevant oligomeric property of the whole 
C-terminus is still poorly characterized, and the necessary 
sequence for this oligomerization remains unknown. In this 
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work, the oligomerization of the SARS-CoV N carboxyl 
terminus was systematically investigated by various bio¬ 
physical and biochemical analyses. Size exclusion chroma¬ 
tography and chemical cross-linking assays revealed that the 
C-terminal 140 amino acids (residues 283—422, N 283 - 422 ) 
were able to form dimers, trimers, tetramers, and hexamers 
depending on the protein’s concentration. Analysis of SARS- 
CoV N protein deletion mutants further identified that 
residues 343—402 are important for N 283-422 oligomerization. 
Although this fragment contains many charged residues, 
changes in ionic strength could not affect its oligomerization, 
which thus indicates that the self-association of the C- 
terminal domain of SARS-CoV N protein is not related to 
electrostatic forces. Additionally, DNA band shift assays 
revealed that N 283-422 is able to interact with nucleic acids, 
and the nucleic acid binding region was mapped as residues 
363—382. The fact that ssDNA binding could promote 
N 283-4222 to form high-order oligomers implies that it is 
possible that the oligomeric SARS-CoV N protein might 
combine with viral genomic RNA to generate higher order 
oligomers, which in turn triggers the formation of the long 
nucleocapsid structure. 

EXPERIMENTAL PROCEDURES 

Plasmid Construction. For construction of the related 
truncated SARS-CoV N proteins, the coding sequences for 
the C-terminal 283-422, 303-422, 283-402, 283-382, 
263-362, 243-342, and 343-422 residues of SARS-CoV 
N proteins were amplified from the full-length SARS-CoV 
N gene with the following primers: N283-422 sense 5'- 
AATTGGArCCACCCAAGGAAATTTCGGGGACCAA- 
3', N283-422 antisense 5'-ACGGG7’CGACTTATGCCTGAGT- 
TGAATCAGCA-3', N303-422 sense S'-AATTGGATCCCCGC- 
AAATTGCACAATTTG-3', N303-422 antisense 5'-GGC- 
CAAGC7TTTATGCCTGAGTTGAATC-3', N283-402 sense 
5'-ACGTCA 7A TG ACCC AAGG AAATTTCGGGG ACC A- 
3', N283-402 antisense S'-TCAAGGATCCTTAATCCATCT- 
CAGCCGCAGGAA-3', N283-382 sense S'-ACGTCATATG- 
ACCCAAGGAAATTTCGGGGACCA-3, N283-382 antisense 
S'-AATTGGATCCTTAAGGCTGAGCTTCATCAGTCT- 
3', N263-362 sense 5'-CCTTCArArGCGTACTGCCAC- 
AAAAC-3', N263-362 antisense S'-CCGTGGArCGTTATTT- 
GTATGCGTCAA-3', N243-342 sense S'-AATTGGATCC- 
CAAGGCCAAACTGTCAC-3’, N243-342 antisense 5^-GGCC- 
GAA7TCTCAGTCATCCAATTTAATG-3', N343-422 sense 
5'-ACCGGGArCCAAAGATCCACAATTCAAAG-3', and 
N343-422 antisense S'-GGCCGAArrGrTATGCCTGAGT- 
TGAATC-3'. The PCR products for N283-422 were digested 
by BamHl and Sail and ligated into pQE30 expression 
vectors (Qiagen). The PCR products for N303-422 were 
digested by BamHl and Hindlll and ligated into pET28a 
expression vectors (Novagen). The PCR products for N283-402, 
N283-382, and N263-362 were digested by Ndel and BamHl and 
ligated into pETlSb expression vectors (Novagen). The PCR 
products for N243-342 and N343-422 were digested by BamHl 
and EcoRI and ligated into pGEX4T-l expression vectors 
(Amersham Parmacia Biotech). The constructs were identi¬ 
fied by PCR reactions with the corresponding primers and 
digested with the restriction enzymes. The sequences of all 
the constructs were further confirmed by DNA sequencing. 
All of the constructs were transformed into DH5a competent 
cells and kept in 20% glycerol at —80 °C. 


Expression and Purification of His-Tagged Proteins. 
The C-terminal domain (N 283 - 422 ) of SARS-CoV N protein 
was cloned into a pQE30 vector with an N-terminal Hise- 
tag and expressed under the control of the T5 promoter. 
The constructed pQE30—N 283-422 recombinant plasmid was 
transformed into Escherichia coli strain M15 and inoculated 
into 10 mL of LB medium containing 100 /rg/mL ampicillin 
and 50 /rg/mL kanamycin at 37 °C. The 10 mL culture 
was transferred into 1 L of LB medium with the same 
antibiotics 10 h later, and the culture was grown at 37 °C 
until the ODeoo (optical density at 600 nm) reached 0.8. 
IPTG (isopropyl /1-D-thiogalactopyranoside) was then added 
to a final concentration of 1.0 mM, and the induction was 
allowed to proceed at 37 °C for 10 h. The E. coli cells were 
harvested by centrifugation at 8000g for 10 min and stored 
at -80 °C. 

The recombinant plasmids pET28a—N 303 - 422 , pET15b— 
N283-402, pET15b-N283-382, and pET15b-N263-362 were 
transformed into E. coli strain BL21(DE3)pLysS. A single 
colony was inoculated into 10 mL of LB medium supple¬ 
mented with ampicillin (final concentration 100 /tg/mL). The 
10 mL culture was transferred into 1 L of LB medium with 
the same antibiotics 10 h later and grown at 37 °C with 
vigorous shaking. When the OD^oo reached 0.7, the expres¬ 
sion of Hise-tag fusion protein was induced by the addition 
of IPTG to a final concentration of 0.5 mM, followed by 5 
h of shaking at 25 °C. The E. coli cells were harvested by 
centrifugation at 8000g for 10 min and stored at —80 °C. 

All the His^-tag fusion proteins were purified by Ni—NTA 
affinity chromatography according to similar protocols. In 
brief, cell pellets from 0.5 L of culture were suspended in 
20 mL of sonication buffer (20 mM Tris—HCl buffer at pH 
8.0, 500 mM NaCl, 10 mM imidazole, 20 /tM PMSE 
(phenylmethylsulfonyl fluoride)) and disrupted by sonication 
for 30 min in an ice bath. Soluble and insoluble fractions 
were separated by centrifugation for 45 min at 14000 rpm 
and 4 °C. The supernatant was incubated with preequilibrated 
Ni—NTA resin (Qiagen) in sonication buffer by rotating 
constantly (15 rpm) at 4 °C for 3 h. The unbound proteins 
were removed by washing with 10 mL of sonication buffer. 
The column was washed with 30 mL each of washing buffers 
I (20 mM Tris-HCl buffer at pH 8.0, 500 mM NaCl, 60 
mM imidazole) and II (20 mM Tris-HCl buffer at pH 8.0, 
500 mM NaCl, 100 mM imidazole). The protein of interest 
was then eluted with elution buffer (20 mM Tris-HCl buffer 
at pH 8.0, 500 mM NaCl, 500 mM imidazole). Tractions 
were collected, and the recombinant protein in the eluted 
fractions was confirmed by 10% (w/v, the mass concentration 
of the acrylamide/bisacrylamide (29:1) mixture in the sep¬ 
arating gel) SDS—PAGE analysis. 

Expression and Purification of GST Fusion Proteins. The 
expression and purification procedures for GST-tag fusion 
proteins were carried out according to instructions from 
Amersham Pharmacia Biotech with some modifications. The 
constructs of pGEX4T-l—N 243-342 and pGEX4T-l—N 343-422 
were transformed into BL21(DE3), and the transformants 
were grown at 37 °C overnight in LB medium in the presence 
of ampicillin (100 /ig/mL). Bacterial cultures were diluted 
with fresh LB medium (plus 100 /rg/mL ampicillin) and 
incubated for 4 h at 37 °C. Eor 7 h with the induction of 
IPTG (final concentration at 0.8 mM) at 30 °C, the bacterial 
cultures were collected by centrifugation, resuspended in PBS 
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buffer (140 mM NaCl, 2.7 mM KCl, 10 mM Na 2 HP 04 , 1.8 
mM KH 2 PO 4 , pH 7.4) containing 20 /rM PMSF, and lysed 
by sonication for 30 min in an ice bath. The cell lysates were 
clarified by centrifugation at 14000 rpm at 4 °C for 60 min. 
The supernatant was mixed with 3 mL of PBS preequilibrated 
glutathione—Sepharose beads (Amersham Pharmacia Bio¬ 
tech) on a rotating bed at 4 °C for 3 h. A 100 mL sample 
of prechilled PBS buffer was used to wash the beads to 
eliminate the loose-binding contaminants. The bound proteins 
were then cleaved on-column to get rid of GST-tag by 5 
units of thrombin (Amersham Pharmacia Biotech) in 5 mL 
of PBS buffer at 16 °C for 12 h. The digested products were 
then further purified by size exclusion chromatography 
(Hiload 16/60 Superdex 75). Fractions were collected, and 
the presence of recombinant protein in the eluted fractions 
was confirmed by 10% SDS—PAGE analysis. 

Size Exclusion Chromatography Based Assay. The olig¬ 
omeric features of the purified fragments of SARS-CoV N 
were analyzed by size exclusion chromatography using 
HiLoad 16/60 Superdex 75 (separation range 3—70 kDa) and 
Hiload 16/60 Superdex 200 (separation range 10—600 kDa) 
on an AKTA FPLC platform (Amersham Pharmacia Bio¬ 
tech). The column was equilibrated at a flow rate of 1 mL/ 
min with PBS buffer at room temperature. The protein 
sample (500 fiL) was injected at a given concentration and 
detected by the absorbance at 280 nm. The column was 
calibrated using a mixture of bovine serum albumin (67 kDa), 
ovalbumin (45 kDa), chymotrypsinogen A (25 kDa), and 
ribonuclease A (13.7 kDa) as a reference. These standard 
proteins were purchased from Amersham Pharmacia Biotech. 
The N283-422 oligomerization was studied according to the 
procedure modified from that of HIV Vpr (28). 

Chemical Cross-Linking Assays. All the tested mutants 
were purified by gel filtration column and digested by 
DNAase and RNAase to get rid of the possible binding 
nucleic acids. Glutaraldehyde (25%, Sigma) was diluted 
to a series of concentrations (1%, 2%, 4%, 6%, 8%, 10%) 
by distilled water. The proteins in PBS buffer were reacted 
with glutaraldehyde on an ice bath for 20 min. Dimethyl 
pimelimidate dihydrochloride (DMP) was resolved in dis¬ 
tilled water at 300 mM. Samples were exchanged into 
PBS buffer and treated with 0—30 mM DMP for 60 min at 
room temperature. The reactions were quenched by an 
equal volume of SDS—PAGE loading buffer, followed by 
heating at 99 °C for 5 min. To evaluate the effects of ionic 
strength differences on N 283-422 oligomerization, the purified 
protein was prepared by different sets in 10 mM HEPES 
buffer at pH 7.4 containing different salts (NaCl, KCl, and 
CaCL). 

Native PAGE Assays. To investigate the molecular mass 
of N 283-422 in the native state, native polyacrylamide gel 
electrophoresis (native PAGE) was performed in 5% stacking 
gel (pH 9.5) and 10% separating gel (pH 9.5). The purified 
protein sample was electrophoresed at 120 V for 3 h. After 
electrophoresis, the gel was stained with Coomassie Brilliant 
Blue R-250 and then destained. The molecular mass of 
N283-422 was further estimated according to the following 
protein standards: bovine serum albumin (67 kDa), oval¬ 
bumin (45 kDa), chymotrypsinogen A (25 kDa), and ribo¬ 
nuclease A (13.7 kDa). 

Gel Shift Assays. To test whether the purified proteins 
contain nucleic acids, the samples were electrophoresed on 
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0.8% agarose gel in TAE buffer (40 mM Tris—acetate buffer 
at pH 8.0, 1 mM EDTA) with the presence of ethidium 
bromide (0.5 pg/mL final concentration) at low voltage (3 
V/cm). The electrophoretic result was directly visualized 
using a short-wavelength UV transilluminator for nucleic acid 
detection, and the same gel was then stained with Coomassie 
Blue R-250 for protein visualization. In the investigation of 
the nucleic acid binding activity for N 283 - 422 , the purified 
proteins were digested with DNAase and RNAase to get rid 
of the possible binding nucleic acids. ssDNA (single-strand 
DNA, 300—400-mer) was purchased from Sigma and 
resolved in doubly distilled water (2 mg/mL) as a stock 
solution. The RNA was extracted from SD rat liver cells 
with a Trizole kit (Sangon). The purified proteins without 
nucleic acids (estimated by OD 260 /OD 280 ) were prepared at 
1 mg/mL in PBS buffer and then mixed with ssDNA or RNA 
at a molar ratio from 2:1 to 1:3 (protein:nucleic acid). These 
mixtures were incubated at 4 °C for 1 h before analysis by 
0.8% agarose gel. 

RESULTS AND DISCUSSION 

Oligomerization Characterization of the SARS-CoV N 
Protein C-Terminus (N 283 - 422 )- The oligomerization of the 
full-length SARS-CoV N protein has been studied widely 
(20—24), and its carboxyl terminal was proved to be 
necessary for its self-association (20, 23). It was discovered 
previously that the C-terminal domain is able to form 
multimers (25), but it was not clear how high the formed 
oligomers could be, and the primary sequence for the 
oligomerization remained unclear. In this work, to investigate 
the oligomeric properties of the C-terminal domain (residues 
283—422, N 283 - 422 ), the gel filtration (Figure 1), chemical 
cross-linking, and native gel electrophoresis technology based 
assays (Figure 2) were applied. 

In the gel filtration assay, N 283-422 samples at concentra¬ 
tions ranging from 8 to 469 //M were applied to a Hiload 
Superdex 200 column and eluted by 180 mL of PBS buffer. 
A subset of these elution profiles is shown in Figure lA. It 
is noticed that, for each case, a high peak (around 46 mL) 
always appears close to the void volume. On the basis of 
their large Stokes radii and invariant behaviors, we assumed 
that this peak probably corresponds to a large, inert aggregate 
of N 283 - 422 - As also indicated in Figure lA, at low concentra¬ 
tion (8 /rM), N 283-422 protein was eluted in two separated 
peaks. One is close to the void volume, and the other is at 
83 mL, which is attributed to a dimer on the basis of the 
standard curve (Figure IB). Interestingly, with an increase 
of the protein concentration to 15, 31, and 44 /rM, the 
corresponding retention volume for the second peak de¬ 
creased (Figure 1) corresponding to a trimer as determined 
from the standard curve in Figure IB. At higher concentra¬ 
tions (>55 /rM), such a peak shifted to a tetrameric status 
(Figure 1). However, at even higher concentrations (>75 
/rM), the protein sample started to produce a third peak 
around 66 mL apart from retaining the second peak 
characteristic of the tetramers. By comparison with the 
standard curve (Figure IB), this third peak was attributed to 
the molecular size of a hexamer. As the concentration 
continuously increased (from 166 to 469 /rM), the peak 
characteristic of a hexamer increased significantly in con¬ 
centration, while the tetrameric peak increased slowly (Figure 
1), which implied that one tetramer associated with two 
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Figure 1: Size exclusion chromatography based analysis of 
N 283 - 422 - (A) Elution profiles of pure N 283-422 on Superdex 200 at 
the indicated concentrations. (B) Standard curve generated by 
protein markers showing the molecular masses of N 283-422 at the 
indicated concentrations. The molecular mass standards are indi¬ 
cated as follows: bovine serum albumin, 67 kDa; ovalbumin, 45 
kDa; chymotrypsinogen A, 25 kDa; ribonuclease A, 13.7 kDa. 

monomers or one dimer into a hexamer, and the tetramer 
and hexamer might be the two concomitant components 
during the virion assembly. 

In the chemical cross-linking assay, if two proteins 
physically interact with each other, they can be covalently 
cross-linked by using bifunctional reagents containing reac¬ 
tive end groups that react with functional groups (primary 
amines or sulfhydryls). The formation of cross-links could 
be analyzed by the common SDS—PAGE, and it is direct 
and convincing evidence of the existence of oligomers. Here 
the cross-linking experiments were performed using glut- 
araldehyde, a short self-polymerizing reagent mostly reacting 
with the e-amino groups of lysine, and DMP, a more specific 
reagent which cross-links lysine residues at shorter distances. 
As shown in Figure 2 A,B, the recombinant N283-422 migrated 
as a single band (lane 2 ) with a relative molecular mass 
around 17 kDa. The cross-linking experiments by increasing 
the concentration of glutaraldehyde and DMP showed that 
N283-422 was able to form dimers, trimers, and tetramers 
(Figure 2 A,B). Furthermore, the native electrophoresis assay 
also detects the presence of trimeric and tetrameric N283-422 
(Figure 2 C). Accordingly, these data indicated that N283-422 
could form dimers, trimers, and tetramers in solution in a 
concentration-dependent manner. 
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Figure 2: Chemical cross-linking and native gel electrophoresis 
analysis of N 283 - 422 - (A) Cross-linking results of N 283-422 with 
glutaraldehyde. N 283-422 was incubated with the indicated concen¬ 
trations of glutaraldehyde for 20 min on an ice bath. (B) Cross- 
linking results of N 283-422 with DMP. N 283-422 was incubated with 
the indicated concentrations of DMP for 20 min at room temper¬ 
ature. The cross-linked samples were run on 8% SDS—PAGE and 
visualized by Coomassie Blue staining. Cross-linking intermediates 
representing monomer, dimer, trimer, and tetramer forms of N 283-422 
are indicated. The values on the left are molecular masses of the 
protein markers. (C) Analysis of the oligomeric state of N283-422 
by native PAGE assay. A 40 fig sample of protein was electro- 
phoresed and stained with Coomassie Blue R-250. Protein standards 
used for molecular mass evaluation were run on the same gel. 

Mapping the Oligomeric Domain of the SARS-CoV N 
Protein Carboxyl Terminus. To determine the primary 
sequence requirements for N283-422 self-association, six 
truncated fragments of SARS-CoV N protein (Figure 3 A) 
were expressed in an E. coli strain and purified by affinity 
chromatography. N303-422. N283-402. N283-382. N263-362 

were purified by a His^-tag constructed at the N-terminus. 
N243-342 and N343-422 were engineered as GST-tag fusion 
proteins to aid expression and purification. The GST-tag was 
removed by thrombin (protease) digestion before further 
analysis. The oligomeric status of each fragment was probed 
at first by size exclusion chromatography (Figure 3 B). Every 
tested sample was prepared in PBS buffer at 150 /rM 
followed by analysis on an FPLC platform using a HiLoad 
Superdex 75 column. N303-422 and N283-402 are able to form 
trimers and tetramers (Figure 3 B), indicating that residues 
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Figure 3: Mapping of the oligomeric domain in SARS-CoV N protein carboxyl terminus. (A) Schematic description of the truncated 
mutants and summary of their oligomeric activities determined by chemical cross-linking assay. (B) Size exclusion chromatography based 
analyses for the six truncated mutants. All retention volume graphs were obtained on a Superdex 75 column. Pure proteins (2 mL) at 150 
fiM protein were loaded and eluted by PBS buffer at room temperature. (C) Chemical cross-linking results of the truncated mutants. The 
proteins were reacted with glutaraldehyde on an ice bath for 20 min. After the cross-linking reaction was quenched, the cross-linked samples 
were analyzed by 10% SDS—PAGE. The letters on the right indicate the following; I, monomer; II, dimer; III, trimer; IV, tetramer. 


283—302 and 403—422 might not be involved in the 
oligomerization of the SARS-CoV N protein C-terminus. 
However, the removal of a further 20 amino acids from the 
C-terminus apparently decreased the ability of the protein 
to oligomerize, for N283-382 was eluted only as a single peak 
corresponding to a trimer. To further orientate the oligomeric 
domain, it is necessary to generate and analyze a shorter 
fragment. However, shorter constructs are too small to 
express and purify. Thus, two other truncated fragments, 
N263-362 and N243-342 (Figure 3 A), were prepared by N- 
terminal extension of 20 and 40 amino acids, respectively. 
As shown in Figure 3 B, N263-362 formed only dimers as 
reported ( 22 , 26 ). Remarkably, a further C-terminal 20 - 
residue truncation caused the protein to completely lose its 
oligomeric ability by showing that the N243-342 fragment 
formed only monomers. Hereunto, gel filtration results 
suggested that the fragment of residues 343—402 is likely 
the oligomeric domain. To further confirm this, a fusion 
protein containing this domain, N343-422 (Figure 3 A), was 
prepared, and its oligomeric status was also studied by gel 
filtration assay. The fact that N343-422 was able to form 
trimers and tetramers indicated that residues 343—402 are 
necessary and sufficient for the oligomerization of the SARS- 
CoV N protein C-terminus. 

Being a widely used approach in studying protein oligo¬ 
merization, chemical cross-linking was also used to inves¬ 
tigate the above six truncated fragments. During the assay, 
the purified proteins were incubated with increasing con¬ 
centrations of glutaraldehyde for 20 min in an ice bath. Cross- 
linked samples were run on 10 % SDS—PAGE and visualized 
by staining with Coomassie Blue. As shown in Figure 3 A,C, 


the fragments N303-422 and N283-402 existed as monomers, 
dimers, trimers, and tetramers, N283-382 and N263-362 were 
able to form dimers and trimers, N243-342 existed as only 
monomers, and N343-422 had the ability to form dimers, 
trimers, and tetramers. These results indicated that residues 
343—402 are crucial and sufficient for the oligomerization 
of the SARS-CoV N protein C-terminus, in agreement with 
the results revealed by the gel filtration assay. Furthermore, 
these results have suggested that residues 343—362 and 383 — 
402 are responsible for the formation of dimers and trimers, 
and tetramers, respectively. It is noticed that the gel filtration 
results showed N263-362 forms only dimers, while the chem¬ 
ical cross-linking results revealed it forms both dimers and 
trimers. The trimers’ presence in the cross-linking assay 
might be due to the high reactive activity of glutaraldehyde 
as a linker. 

A Change in Ionic Strength Has No Ejfect on V 283-422 
Oligomerization. As has been reported ( 7 ), the C-terminus 
of SARS-CoV N protein (N283-422) contains a number of 
charged residues, which leads us to assume that SARS-CoV 
N protein probably self-associates by electrostatic forces. 
If so, the differences in ionic strength will affect the protein’s 
oligomeric activity. To evaluate this assumption, the oligo¬ 
meric behaviors of N283-422 at different ionic strengths were 
studied by chemical cross-linking assay, in which the purified 
N283-422 proteins were cross-linked in HEPES buffer con¬ 
taining monovalent and divalent chloride salts. As shown 
in Eigure 4 , N283-422 showed almost the same oligomeric 
activity at different ionic strengths, implying the absence 
of electrostatic forces in the oligomerization of SARS-CoV 
N protein. 
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Figure 4: Effect of ionic strength on the oligomeric activity of 
N 283 - 422 - The pure N 283-422 was prepared in different sets of 10 
mM HEPES buffer at pH 7.4 each containing different salts (NaCl, 
KCl, and CaCL). These samples reacted with 0.03% glutaraldehyde 
on an ice bath for 30 min. After the cross-linking reaction was 
quenched, the cross-linked samples were analyzed by 10% SDS— 
PAGE. 

N283-422 Has the Ability to Associate with Nucleic Acids. 
As discussed above, a high peak was always observed close 
to the void volume when pure N283-422 was analyzed by gel 
filtration assay (Figure 1 A). On the basis of their large Stokes 
radii and invariant behavior, we think these complexes prob¬ 
ably represent large aggregates of N283-422- The absorbance 
in the ultraviolet light range indicated that the OD280/OD260 
value of this peak was 0 . 86 , indicating the presence of a 
high concentration of nucleic acids. To confirm the presence 
of nucleic acids, the sample was further subjected to gel shift 
assays, similar to the established methods for analysis of 
RNA binding to protein ( 20 , 22 , 28 ). During the assay, the 
samples were electrophoresed on an 0 . 8 % agarose gel in the 
presence of ethidium bromide. The nucleic acids in the gel 
were visualized by ultraviolet excitation. As shown in Figure 
5 A, the sample apparently contained a large amount of 
nucleic acids. The same gel was then stained with Coomassie 
Blue for checking protein migration. The p/ (isoelectric point) 
of N283-422 is 9 . 5 , and this protein could not run into the gel 
due to its positively charged state at pH 8 . 0 . In our gel shift 
assay, however, the proteins and nucleic acids were found 
at the same position on the gel, indicating that the interactions 
with negatively charged nucleic acids help the protein run 
into the gel. These results thus suggested the possibility that 
N283-422 had the ability to associate with nucleic acids, which 
further promotes N283-422 to form high-order oligomers. 

Recently, it was reported that the recombinant SARS-CoV 
N protein binds to nonspecific DNA and RNA ( 20 , 22 ). To 
investigate whether the N protein C-terminus is involved in 
the association with nucleic acids, the possible interactions 
between N283-422 protein and DNA as well as RNA were 
studied by gel shift assays. During the experiments, a 
constant amount of N283-422 protein was mixed with nucleic 
acids at different molar ratios. As shown in Figure 5 B, the 
ssDNA and RNA itself migrated farther away from the 
loading well, while the ssDNA/protein and RNA/protein 
mixtures migrated a much shorter distance. Meanwhile, such 
a migration distance (Figure 5 B) increased with the amount 
of ssDNA and RNA, which indicated that this association 
with ssDNA or RNA allowed N283-422 to run further on the 
gel. However, this effect was not seen in BSA control assays. 
Accordingly, our results clearly indicated that the C-terminus 
(N283-422) of SARS-CoV N protein also has the ability to 
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Figure 5: N 283-422 is able to interact with nucleic acids. (A) The 
peak at the void volume contains protein and a high concentration 
of nucleic acids. (B) The N 283-422 interaction with ssDNA and RNA 
was studied by gel shift assay. 

associate with nucleic acids without sequence specificity, 
apart from the reported fact that the SARS-CoV N protein 
N-terminus (residues 45 — 181 ) could interact with RNA { 30 ). 
Meanwhile, our result has supported the recent report that 
both N-terminal and C-terminal regions of SARS-CoV N 
protein have binding activities to the viral RNA ( 76 ). 

ssDNA’s Association Promotes N2S3-422 to Form High- 
Order Oligomers. To further confirm whether the association 
with nucleic acids can promote N283-422 to form high-order 
oligomers, the above ssDNA/protein complexes (DNA is 
more stable than RNA) were analyzed by chemical cross- 
linking (Figure 6 A) and gel filtration assays (Figure 6 B). 
The corresponding samples without cross-linking were also 
tested by SDS—PAGE as references. In the absence of 
ssDNA as shown in Figure 6 A, the cross-linked N283-422 
existed as monomers, dimers, trimers, and tetramers. As the 
ssDNA concentration increased, the concentrations of mono¬ 
mers, dimers, trimers, and tetramers decreased, which 
suggested that the binding with ssDNA might promote 
N283-422 to form high-order oligomers. Such a result could 
also be confirmed by gel filtration assays as indicated in 
Figure 6 B. The pure N283-422 was eluted as a single peak 
corresponding to tetramers. When ssDNA was mixed with 
N283-422 at different molar ratios, the peak close to the void 
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Figure 6 : ssDNA binding helps N283-422 form high-order oligo¬ 
mers. (A) Chemical cross-linking results of SSDNA/N283-422 
complexes. Those complexes without glutaraldehyde cross-linking 
were also tested hy SDS—PAGE and used as controls. (B) Gel 
filtration analysis of SSDNA/N283-422 complexes. 


volume appeared and increased with the ssDNA concentra¬ 
tion. In contrast, the tetrameric peak decreased with an 
increase of ssDNA and almost disappeared at a 3:1 (ssDNA: 
N283-422) molar ratio. Hence, these results indicated that 
ssDNA binding could indeed promote the N283-422 to form 
high-order oligomers. Actually, many viral proteins were 
reported to interact with nucleic acid to form high-order 
complexes, for instance, nucleocapsid protein of chandipura 
virus ( 31 ), nucleocapsid protein of vesicular stomatitis virus 
( 52 ), the VP 40 of Marburg virus ( 55 ), and the nucleocapsid 
protein of sindbis virus ( 29 ). 

Residues 363—382 of the SARS-CoV N Protein Carboxyl 
Terminus are Required for Nucleic Acid Binding. To address 
which regions in the carboxyl terminus of SARS-CoV N 
protein were required for nucleic acid binding, six deletion 
mutants were assayed by DNA band shift assays. The pi 
values for N303_422, N283-402, N283-382, N263-362, N243-342, and 
N343-422 are predicted to be 9 . 9 , 9 . 7 , 9 . 5 , 9 . 8 , 10 . 3 , and 9 . 8 , 
respectively. As shown in Figure 7 , N303-422, N283-402, and 
N283-382 could associatc with ssDNA and migrate into the 
gel; however, the removal of a further 20 and 40 amino acids, 
generating N263-362 and N243-343 (Figure 3 A), caused the 
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Figure 7: Mapping of the nucleic acid binding domain by DNA 
gel shift assays. The protein sample was mixed with ssDNA at a 
molar ratio of 1:3. The samples were electrophoresed on 0.8% 
agarose gel in TAE buffer with low voltage. The electrophoretic 
result was directly visualized using a UV transilluminator for nucleic 
acid detection, and the same gel was then stained with Coomassie 
Blue R-250 for protein visualization. 
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Figure 8: Summary of the oligomeric domains and nucleic acid 
binding domain in the carboxyl terminus of SARS-CoV N protein. 


proteins to lose the capability of binding to ssDNA. The 
ability of interacting with ssDNA was restored for N343-422 
(Figure 3 A), as N283-422- These results thus indicated that 
residues 363—382 are responsible for nucleic acid binding 
of the SARS-CoV N protein carboxyl terminus. As demon¬ 
strated in Figure 8, such a region consists of six lysines and 
is likely to be involved in nonspecific interactions with the 
RNA through charge neutralization. 

In conclusion, in this work we have completely discovered 
that the C-terminus of SARS-CoV N protein (residues 283 — 
422 ) could form dimers, trimers, tetramers, and hexamers 
in a concentration-dependent manner. The crucial sequence 
for such oligomerization is determined to be residues 343 — 
402 (Figure 8 ). Although the C-terminus of SARS-CoV N 
protein contains many charged residues, electrostatic force 
was not involved in its oligomerization. Furthermore, it is 
found that the SARS-CoV N protein C-terminus was able 
to associate with nucleic acids, and the interaction domain 
was mapped out as the residues 363 — 382 , in which the six 
lysines are supposed to be responsible for its RNA binding 
(Figure 8 ). Moreover, ssDNA could incorporate N283-422 
tetramers into high-order oligomers. Taken together, these 
data suggested that the overlapping oligomerization and 
nucleic acid binding domain, i.e., residues 343 — 402 , might 
be the most critical sequence for nucleocapsid structure 
formation. Since much of the driving force for nucleocapsid 
assembly comes from protein—protein interactions and 
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protein—RNA interactions involving the C-terminus of 
SARS-CoV N protein, a hypothesis was proposed for the 
nucleocapsid assembly. Self-association of SARS-CoV N 
protein to form oligomers, especially tetramers, is the first 
step of the assembly process. Then these oligomers continu¬ 
ously bind to the genomic RNA in triggering the formation 
of a long nucleocapsid structure. 
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